Some Concept of Dispersion Measure for Categorical Data.

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DISC: Data-Intensive Similarity Measure for Categorical Data

The concept of similarity is fundamentally important in almost every scientific field. Clustering, distance-based outlier detection, classification, regression and search are major data mining techniques which compute the similarities between instances and hence the choice of a particular similarity measure can turn out to be a major cause of success or failure of the algorithm. The notion of s...

متن کامل

An association-based dissimilarity measure for categorical data

In this paper, we propose a novel method to measure the dissimilarity of categorical data. The key idea is to consider the dissimilarity between two categorical values of an attribute as a combination of dissimilarities between the conditional probability distributions of other attributes given these two values. Experiments with real data show that our dissimilarity estimation method improves t...

متن کامل

Transiogram: A spatial relationship measure for categorical data

Categorical geographical variables are normally classified into multinomial classes which are mutually exclusive and visualized as area-class maps. Typical categorical variables such as soil types and land cover classes are multinomial and exhibit complex interclass relationships. Interclass relationships may include three situations: cross-correlation (i.e. interdependency), neighbouring situa...

متن کامل

Survey on Clustering Algorithm and Similarity Measure for Categorical Data

Learning is the process of generating useful information from a huge volume of data. Learning can be either supervised learning (e.g. classification) or unsupervised learning (e.g. Clustering) Clustering is the process of grouping a set of physical objects into classes of similar object. Objects in real world consist of both numerical and categorical data. Categorical data are not analyzed as n...

متن کامل

Clustering Categorical Data Using an Extended Modularity Measure

Newman and Girvan [12] recently proposed an objective function for graph clustering called the Modularity function which allows automatic selection of the number of clusters. Empirically, higher values of the Modularity function have been shown to correlate well with good graph clustering. In this paper we propose an extended Modularity measure for categorical data clustering; first, we establi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Japanese journal of applied statistics

سال: 1998

ISSN: 0285-0370,1883-8081

DOI: 10.5023/jappstat.27.83